A Dynamic Group Management Framework for Large-scale Distributed Event Monitoring
نویسنده
چکیده
Distributed event monitoring is an important service for fault, performance and security management. Next generation event monitoring services are higly distributed and invovling a large number of monitoring agents. In order to support scalabel event monitoring, the monitoring agents use IP multicasting as a group communication for exchanging events and control information. However, dueto the dynamic nature of event detection and correlation during the monitoring process, agents groups organization and coordination such as membership and tasks assignments becomes a challenging issue. Thus, supporting an appropriet group mangement infrastrcutre is a key issue for developing scalable event monitoring services. This paper presents an efficient group management framework based on IP multicast that dynamically reconfigure the group strcutures and memebership assignments at run-time according to the event correlation requirements which allows for optimal delivery of multicast messages between the management entities. This framework provides techniques for solving agents’ state synchronization, free-collision group allocation and agents bootstrap problems in distributed event monitoring. The presented framework has been implemented within HiFi monitoring system which is a distributed hierarchical monitoring system.
منابع مشابه
HiFi+: A Monitoring Virtual Machine for Autonomic Distributed Management
Autonomic distributed management enables for deploying self-directed monitoring and control tasks that track dynamic network problems such as performance degradation and security threats. In this paper, we present a monitoring virtual machine interface (HiFi+) that enables users to define and deploy distributed autonomic management tasks using simple Java programs. HiFi+ provides a generic expr...
متن کاملHiFi: A New Monitoring Architecture for Distributed Systems Management
With the increasing complexity of large-scale distributed (LSD) systems, an efficient monitoring mechanism has become an essential service for improving the performance and reliability of such complex applications. This paper presents a scalable, dynamic, flexible and non-intrusive monitoring architecture for managing large-scale distributed (LSD) systems. This architecture, which is is referre...
متن کاملHierarchical Filtering-based Monitoring System for Large-scale Distributed Applications
On-line monitoring of large-scale distributed (LSD) applications is an eeective means to observe the appli-cations' behavior at run-time and provide status information required by debugging and management tools. In this paper, we describe and motivate the architecture and the components design of a scalable, high-performance, dynamic and non-intrusive monitoring system for LSD applications. The...
متن کاملWorkflow management in large distributed systems
The MonALISA (Monitoring Agents using a Large Integrated Services Architecture) framework provides a distributed service system capable of controlling and optimizing large-scale, data-intensive applications. An essential part of managing large-scale, distributed data-processing facilities is a monitoring system for computing facilities, storage, networks, and the very large number of applicatio...
متن کاملA Case For Lightweight Dynamic Event Based Monitoring and Management Support For Large Scale DataCenters
Future large scale systems, such as cloud datacenters, with increased core counts will soon result in infrastructures with millions of cores. This poses challenges for monitoring and management not yet met by existing experimental or commercial software systems. At the core of this challenge is to perform continuous and on-demand monitoring queries over distributed aggregated data resulting fro...
متن کامل